Peek Arc Consistency 

Manuel Bodirsky Hubie Chen 



Abstract 

This paper studies peek arc consistency, a reasoning teciinique thiat extends the well- 
known arc consistency technique for constraint satisfaction. In contrast to other more costly 
extensions of arc consistency that have been studied in the literature, peek arc consistency 
requires only linear space and quadratic time and can be parallelized in a straightforward 
way such that it runs in linear time with a linear number of processors. We demonstrate 
that for various constraint languages, peek arc consistency gives a polynomial-time decision 
procedure for the constraint satisfaction problem. We also present an algebraic characteriza- 
tion of those constraint languages that can be solved by peek arc consistency, and study the 
robustness of the algorithm. 

1 Introduction 

Background. A basic knowledge reasoning task that has been studied in many incarnations is to decide 
the satisfiability of given relationships on variables, where, for instance, variables may represent objects 
such as temporal events or spatial regions, and relationships may express precedence, containment, overlap, 
disjointness, and so forth. Instances of this reasoning task can typically be modeled using the constraint 
satisfaction problem (CSP), a computational problem in which the input consists of a set of constraints 
on variables, and the question is whether or not there is an assignment to the variables satisfying all of 
the constraints. While the CSP is in general NP-hard, researchers have, in numerous settings, aimed to 
identify restricted sets of relationships under which the CSP is polynomial-time decidable; we refer to sets 
of relationships as constraint languages. 

Arc consistency is an algorithmic technique for constraint satisfaction that has been heavily studied 
and for which highly efficient implementations that are linear in both time and space are known. Arc 
consistency provides a one-sided satisfiability check. It may detect an inconsistency, which always implies 
that the input instance is unsatisfiable. While the converse does not hold in general, it has been shown 
to hold for some particular constraint languages, that is, arc consistency provides a decision procedure 
for satisfiability for these languages. Examples include the language of boolean Horn clauses; various 
graph homomorphism problems, for example, homomorphisms to orientations of finite paths [11]; and all 
constraint languages where satisfiability is first-order definable O]- 

Curiously, arc consistency typically cannot be used as a decision procedure for infinite-domain con- 
straint languages, by which we mean constraint languages under which variables can take on infinitely 
many values. In many cases, a reason for this is that arc consistency performs inference by considering 
unary (arity 1) projections of relations, and all such projections are akeady equal to the full domain of the 
language. As an example, consider the binary relations < and ^ interpreted over the domain of rational 
numbers Q. For each of these relations, both of the two possible unary projections are equal to Q, and arc 
consistency in fact will not perform any inference. 

Path consistency is a more powerful algorithmic technique that provides a polynomial-time decision 
procedure for further finite and infinite domain constraint languages. However, the greater power comes 
at the price of worse time and space complexity: the best known implementations require cubic space and 
quadratic time. Unfourtunately, this makes the path consistency procedure prohibitive for many applica- 
tions where one has to deal with large instances of the constraint satisfaction problem. 

Peek arc consistency. In this paper, we study a general algorithmic technique for constraint satisfaction 
that we call peek arc consistency. Here, we describe the idea of the algorithm for finite-domain constraint 
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satisfaction, although, as we show in the paper, this algorithm can be effectively applied to infinite-domain 
constraint satisfaction as well. The algorithm performs the following. For each variable-value pair {x, a), 
the variable x is set to the value a, and then the arc consistency procedure is run on the resulting instance 
of the CSP. If there is a variable x such that for all values a the arc consistency procedure detects an 
inconsistency on (x, a), then the algorithm reports an inconsistency. As with arc consistency, this algo- 
rithm provides a one-sided satisfiability check. One might concieve of this algorithm as being a step more 
sophisticated than arc consistency; it invokes arc consistency as it takes a "peek" at each variable. 

Peek arc consistency has many practical and theoretical selling points. Arc consistency can be im- 
plemented in linear space, for any fixed finite-domain constraint language and many infinite-domain con- 
straint languages; the same holds for peek arc consistency. The time complexity of peek arc consistency is 
quadratic in the input size, which is still much better than the path-consistency algorithm, where the best 
known implementations have a running time that is cubic in the input size. Moreover, peek arc consistency 
can be parallelized in a straightforward way: for each variable-value pair, the arc consistency procedure 
can be performed on a different processor Hence, with a linear number of processors, we achieve a linear 
running time, for a fixed constraint language. We would also like to remark that implementing peek arc 
consistency is straightforward if one has access to an implementation of arc consistency as a subroutine. 

We demonstrate that the class of constraint languages solvable by peek arc consistency is a considerable 
extension of that which can be solved by arc consistency, and in particular contains many infinite-domain 
constraint languages. Examples are the constraint satisfaction problem for the point algebra in temporal 
reasoning 1 15 1, and tractable set constraints |8 1. But also, several finite-domain constraint languages where 
previously the "best" known algorithm was the path-consistency procedure can be solved by our peek arc 
consistency procedure. For example, this is the case for homomorphism problems to unbalanced orienta- 
tions of cycles |9 |. Other examples that can be solved by peek arc consistency but not by arc consistency 
are 2-SAT, and many other CSPs where the relations are closed under a dual descriminator or a median 
operation. 

Our study of peek arc consistency employs universal algebraic techniques which have recently come 
into focus in the complexity of constraint satisfaction. In addition to obtaining results showing that lan- 
guages are tractable by this algorithm, we develop an algebraic characterization of the constraint languages 
solvable by the algorithm. The characterization is exact-necessary and sufficient-for all finite and infi- 
nite domain constraint languages. We also exhibit closure properties on the class of constraint languages 
tractable by the algorithm. 

A notable feature of this work is the end to which universal algebraic techniques are applied. Thus far, in 
constraint satisfaction, such techniques have primarily been used to demonstrate complexity class inclusion 
results, such as polynomial-time decidability results, and completeness results, such as NP-completeness 
results. Here, we utilize such techniques to investigate the power of a particular efficient and practical 
algorithm. That is, we differentiate among constraint languages depending on whether or not they are 
solvable via a specific algorithmic method, as opposed to whether or not they are contained in a complexity 
class. To our knowledge, this attitude has only been adopted in a limited number of previous papers that 
studied arc consistency and extensions thereof iH E) . 

2 Preliminaries 

Our definitions and notation are fairly standard. 

Structures. A tuple over a set B is an element of B'^ for a value fc > 1 called the arity of the tuple; when 
t is a tuple, we use the notation t = {ti, . . . ,tk) to denote its entries. A relation over a set i? is a subset 
of B^ for a value fc > 1 called the arity of the relation. A signature cr is a finite set of symbols, each of 
which has an associated arity. We use tt^ to denote the operator that projects onto the ith coordinate: 7ri(t) 
denotes the ith entry ti of a tuple t = {ti, . . . , tk), and for a relation R we define TTi{R) — {TTi{t) \ t € R}. 

A structure B over signature a consists of a universe B, which is a set, and a relation R^ C B'' 
for each symbol R of arity k. (Note that in this paper, we are concerned only with relational structures, 
which we refer to simply as structures.) Throughout, we will use the bold capital letters A, B, ... to denote 
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structures, and the corresponding non-bold capital letters A,B,...to denote their universes. We say that a 
structure B is finite if its universe B has finite size. 

For two structures A and B over the same signature cr, the product structure A x B is defined to be the 
structure with universe Ax B and such that R^^^ {((ai, 61), . . . , (ofc, 5fe)) | a £ R^,b e R^} for all 
R & a. We use A" to denote the n-fold product A x ■ • ■ x A. 

We say that a structure B over signature <t' is an expansion of another structure A over signature cr if 
(1) cr' D cr, (2) the universe of B is equal to the universe of A, and (3) for every symbol i? G cr, it holds 
that R^ = R^. We will use the following non-standard notation. For any structure A (over signature cr) 
and any subset SCA,we define [A, S] to be the structure with the signature a U {U} where [/ is a new 
symbol of arity 1, defined by C/I^'^l = S and = R^ for all R e a. 

For two structures A and B over the same signature cr, we say that A is an induced substructure of B 
if ^4 C _B and for every i? G cr of arity k, it holds that R^ = A'^ Ci R^. Observe that for a structure B and 
a subset B' C B, there is exactly one induced substructure of B with universe B' . 

Homomorphisms and the constraint satisfaction problem. For structures A and B over the same 
signature cr, a homomorphism from A to B is a mapping h : A B such that for every symbol R of a 
and every tuple (ai, . . . , a^,) G R^, it holds that {h{ai), . . . , h{ak)) G R^. We use A ^ B to indicate 
that there is a homomorphism from A to B; when this holds, we also say that A is homomorphic to B. 
The homomorphism relation is transitive, that is, if A ^ B and B ^ C, then A C. 

For any structure B (over cr), the constraint satisfaction problem for B, denoted by CSP(B), is the 
problem of deciding, given as input a finite structure A over cr, whether or not there exists a homomorphism 
from A to B. In discussing a problem of the form CSP(B), we will refer to B as the constraint language. 

There are several equivalent definitions of the constraint satisfaction problem for a constraint language, 
most notably the definition used in artificial intelligence. In logic, the constraint satisfaction problem can 
be formulated as the satisfiability problem for primitive positive formulas in a fixed structure B. Homo- 
morphism problems as defined above have been studied independently from artificial intelligence in graph 
theory, and the connection to constraint satisfaction problems has been observed in lilOi . 

pp-definability. Let cr be a signature; a primitive positive formula over cr is a formula built from atomic 
formulas R{wi, . . . , w„) with R £ a, conjunction, and existential quantification. A relation R C B^ 
is primitive positive definable (pp-definable) in a structure B (over a) if there exists a primitive positive 
formula (f>{vi, . . . ,Vk) with free variables vi, . . . ,Vk such that 

{bi,...,bk) e R^B,bi,...,bk\=(l). 

Automorphisms. An isomorphism between two relational structures A and B over the same signature cr 
is a bijective mapping from Ato B such that t G R^ if and only if /(<) G R^ for all relation symbols R 
in cr. An automorphism of A is an isomorphism between A and A. An orbit of A is an equivalence class 
of the equivalence relation = that is defined on Ahy x = y iff a{x) — y for some automorphism a of A. 

Polymorphisms. When/ : _B" _B is an operation on B and ti = {tu, . . . ,tik), .. .,tn — {tni, ■ ■ ■ ,tnk) G 
B'' are tuples of the same arity k over B, we use /(ti, . . . , i„) to denote the arity k tuple obtained by ap- 
plying / coordinatewise, that is, /(ZT, . . . ,Q = (/(in, . . . , i„i), . . . , /{hk, t„fe)). An operation 
/ : i?" — > i? is a polymorphism of a structure B over cr if for every symbol R £ a and any tuples 
ti, . . . ,tn G R^, it holds that f{ti, . . . , tn) G R^. That is, each relation R^ is closed under the action of 
/. Equivalently, an operation / : B" ^ B is a polymorphism of B if it is a homomorphism from B" to 
B. Note that every automorphism is a unary polymorphism. 

Categoricity. Several of our examples for constraint languages over infinite domains will have the fol- 
lowing property that is of central importance in model theory. A countable structure is ui-categorical if 
all countable models of its first-order theorjQ are isomorphic. By the Theorem of Ryll-Nardzewski (see 

^ The first-order theory of a structure is the set of first-order sentences that is true in the structure. 
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e.g. lfT2ll ) this is equivalent to the property that for each n there is a finite number of inequivalent first-order 
formulas over F with n free variables. A well-known example of an tj-categorical structure is (Q, <); for 
many more examples of cj-categorical structures and their application to formulate well-known constraint 
satisfaction problems, see 12 ■ 

3 Arc Consistency 

In this section, we introduce the notion of arc consistency that we will use, and review some related notions 
and results. The definitions we give apply to structures with relations of any arity, and not just binary rela- 
tions. The notion of arc consistency studied here is sometimes called hyperarc consistency. Our discussion 
is based on the paper Q. 

For a set B, let p{B) denote the power set of B. For a structure B (over a), we define p(B) to 
be the structure with universe p{B) \ {0} and where, for every symbol i? e cr of arity k, R*^^^^ = 
{{TriS,...,TrkS)\SCR^,S^9}. 

Definition 1 An instance A of CSP{R) has the arc consistency condition (ACC) if there exists a homo- 
morphismfrom A to p(B). 

Definition 2 We say that arc consistency (AC) decides C5P(B) if for all finite structures A, the following 
holds: (A, B) has the ACC implies that A B. 

Note that the converse of the condition given in this definition always holds. By a singleton, we mean 
a set containing exactly one element. 

Proposition 3 For any structures A and B, if h is a homomorphism from A to B, then the mapping that 
takes a to the singleton {/i(a)} is a homomorphism from A to p(B). 

Hence, when AC decides CSP(B), an instance A of CSP(B) is a "yes" instance if and only if A has 
the ACC with respect to B. That is, deciding whether an instance A is a "yes" instance can be done 
just by checking the ACC. It was observed in |[Tol that, for any finite structure B, there is an algebraic 
characterization of AC: AC decides CSP(B) if and only if there is a homomorphism from p(B) to B. 

It is well-known that for a finite structure B, whether or not instances A of CSP(B) have the ACC 
can be checked in polynomial-time. The algorithm for this is called the arc consistency procedure, and 
it can be implemented in linear time and linear space in the size of A; note that we consider B to be 
fixed. The same holds for many infinite-domain constraint languages, for example for all cj-categorical 
constraint languages. Since this is less well-known, and requires a slightly less standard formulation of 
the arc consistency procedure, we present a formal description of the algorithm that we use; this algorithm 
can be applied for any (finite- and infinite-domain) constraint language that has finitely many pp-definable 
unary relations in B. 

We assume that B contains a relation for each unary primitive positive definable relation in B. This is 
not a strong assumption, since we might always study the expansion B' of B by all such unary relations. 
Then, if we are given an instance A of CSP(B), we might run the algorithm for CSP(B') on the expansion 
A' of A that has the same signature as B' and where the new unary relations are interpreted by empty 
relations. It is clear that a mapping from A to £? is a homomorphism from A to B if and only if it is a 
homomorphism from A' to B'. 

To conveniently formulate the algorithm, we write R^{x) for the relation symbol of the relation that 
is defined by a pp-formula (t>{x) in B. We write QaHo-I: ■ • • , a;}) for the conjunction over all formulas 
of the form S{ai) where 5 is a unary relation symbol such that a-i £ S^. The pseudo-code of the arc 
consistency procedure can be found in Figure [T] 

The space requirements of the given arc consistency procedure are clearly linear. It is also well-known 
and easy to see that the procedure can be implemented such that its running time is linear in the size of the 
input. 

Proposition 4 Let ^ be a structure with finitely many primitive-positive definable unary relations. Then a 
given instance A ofCSP(B) has the ACC if and only if the arc consistency procedure presented in Figure\l] 
does not reject. 
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Arc Consistency B (A) 






Input: a finite relational structure A. 






Do 






For every relation symbol R, every tuple (ai , 




and every i S {1, . . . , Z} 


Let (j) be the formula 3ai, . . . , ai_i, a^+i, 


... , ai.R{ai, . 


. .,ai) ^ Qxiiai, . . . , ai]) 


If (p defines the empty unary relation over 


B then reject 




Else, add ai to i?^ 






Loop until no relation in A is changed 







Figure 1: The arc consistency procedure for CSP(B), where B contains all primitive positive definable 
unary relations in B. 

In particular, we can apply the algorithm shown in Figure [T] to all constraint satisfaction problems 
with an w-categorical constraint language. However, it was shown that in this case the algorithm cannot 
be used as a decision procedure for CSP(B) (i.e., that rejects an instance A if and only if it does not 
homomorphically map to B), unless B is homomorphically equivalent to a finite structure [4|. 

4 Peek Arc Consistency 

We present basic definitions and results concerning peek arc consistency. The following two definitions are 
analogous to Definitions [T] and |2]of the previous section. 

Definition 5 An instance (A, B) of the CSP has the peek arc consistency condition (PACC) if for every 
element a G A there exists a homomorphism hfrom A to p(B) such that h(a) is a singleton. 

Definition 6 We say that peek arc consistency (PAC) decides C5f (B) if for all finite structures A, the 
following holds: (A, B) has the PACC implies that A — > B. 

The converse of the condition given in this definition always holds. Suppose that A ^ B; then, the 
mapping taking each a G A to the singleton {h{a)} is a homomorphism from A to p(B) (Proposition[3]l, 
and hence (A, B) has the PACC. 

We now present an algorithm that decides for a given instance A of CSP(B), whether (A, B) has the 
PACC. We assume that B has a finite number of orbits and pp-definable binary relations. This holds in 
particular for all cj-categorical structures. The following lemma then allows us to use the arc consistency 
procedure presented in Figure [1] for every expansion of B by singletons. 

Lemma 7 Let 'B be a structure with finitely many pp-definable binary relations. Then every expansion of 
B /jy fl constant has finitely many pp-definable unary relations. 

Proof. Suppose for contradiction that for a constant &, there are infinitely many pairwise distinct unary 
relations with a pp-definition in the expansion of B with {h}. For each such definition, if we replace the 
occurrences of the relation symbol for the singleton {6} by a new variable, we obtain formulas that are 
pp-definitions in B of pairwise distinct binary relations. □ 

Proposition 8 Let B be a structure with finitely many orbits and finitely many pp-definable binary rela- 
tions. Then a given instance A o/C5f (B) has the PACC if and only if the algorithm presented in Figure^ 
does not reject A. 

Proof. Suppose that A is an instance of CSP(B) that has the PACC. We have to show that for any element 
a from A there exists an orbit O of B such that for any choice of 6 G O the arc consistency procedure that is 
called in the inner loop of the algorithm in Figure|2]does not reject the instance [A, {a}] of CSP([B, {6}]). 
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Peeks (A) 

Input: a finite relational structure A. 

For every element a of A 
For all b e {61, ... ,6;} 

Run the arc-consistency procedure on the instance [A, {a}] of CSP([B, {b}]) 
If the arc consistency procedure rejects for pp-definable sets, reject 



Figure 2: The peek arc consistency procedure for CSP(B) for structures B that have finitely many orbits 
and pp-definable binary relations. Let 61, . . . , 6; be arbitrary representatives from the orbits in B, i.e., we 
assume that there are / orbits and 61 , . . . , 6; are in pairwise distinct orbits. 

Because A has the PACC, there exists a homomorphism from A to p(B) such that h{a) is a singleton 
{c}. Let O be the orbit of c, and let b be the element from O that is used by the algorithm. We know 
that there exists an automorphism a that maps c to b. Clearly, the mapping h' defined by a; 1-^ a{h{x)) 
is a homomorphism from A to p(B) such that h'{a) = {b} is a singleton. By Lemma [T] the structure 
[B, {b}] has finitely many pp-definable unary relations. Proposition]?] then shows that the arc consistency 
procedure does not reject the instance [A, {a}] ofCSP([B, {b}]). All the implications in this argument can 
be reversed, which shows the statement of the proposition. □ 

Theorem 9 Let ^ be a structure with finitely many orbits and finitely many pp-definable binary relations, 
and suppose that PAC solves CSP(B). Then there exists a quadratic-time and linear-space algorithm that 
decides CSP(B). Moreover, CSP(B) can be decided in linear time with a linear number of processors. 

Proof. Let A be an instance of CSP(B). If the algorithm in Figure ]2]rejects A, then it does not have the 
PACC and hence A does not homomorphically map to B. If the algorithm in Figure]2]does not rejects A, 
then A has the PACC. By assumption, PAC solves CSP(B), and therefore there exists a homomorphism 
from A to B. 

Because the arc consistency procedure uses linear space, the algorithm in Figure]2]can be implemented 
in linear space as well. The arc consistency procedure is called a linear number of times (recall that B is 
fixed and not part of the input). Because the arc consistency procedure can be implemented such that it 
uses linear time, the overall running time on a sequential machine is quadratic in the worst case. However, 
note that each application of the arc consistency procedure can be performed on a different processor. □ 

5 Algebraic Characterization 

In this section we present a general algebraic characterization of those constraint languages where PAC 
decides CSP(B) for an arbitrary finite or infinite structure B. 

We use the notation Ind(p(B)") to denote the induced substructure of p(B)" whose universe contains 
an n-tuple of p(B)" if and only if at least one coordinate of the tuple is a singleton. 

Theorem 10 Let ^ be a structure. PAC decides CSP(B) if and only if for all n there is a homomorphism 
from all finite substructures o/Ind(p(B)") to B. 

Proof. (^): Suppose that (A, B) has the PACC. Then, by definition of the PACC, for all a £ A, there is a 
homomorphism ha from [A, a] to [p(B), {{b} \ b G B}]. Let n = |j4|. Now consider the homomorphism 
h from A to p(B)" defined by h{x) = IlaeAha{x). Notice that for every a G A, the element ha{a) 
of the tuple h{a) is a singleton, and hence h is in fact a homomorphism from A to Ind(p(B)"). Let C 
be the structure that is induced by the image of h in Ind(p(B)"). Since C is finite, it is by assumption 
homomorphic to B, and by composing homomorphisms we obtain that there is a homomorphism from A 
to B. 
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Let n > 1, and let C be a finite substructure of Ind(p(B)"). We have to show that C, viewed as 
an instance of CSP(B), has the PACC, which suffices by assumption. Let a be any element of the universe 
of C. By definition of C, we have that a is an n-tuple such that some coordinate, say the ith coordinate, is 
a singleton. The projection function tt^ is a homomorphism from [C, {a}] to [p(B), {{6} | b G B}]. □ 

6 Robustness 

In this section, we demonstrate that the class of structures B such that PAC decides CSP(B) is robust in 
that it satisfies certain closure properties. 

We first investigate expansion by pp-definable relations. Say that a structure B' (over a') is a pp- 
expansion of B (over cr) if B' is an expansion of B and for every symbol R ^ a' \a,\t holds that i?^ is 
pp-definable over B. 

Theorem 11 Suppose that PAC decides C5P(B). Then for any pp-expansion B' o/B, it holds that PAC 
decides CSP{B'). 

Proof. It suffices to show that the theorem holds for an expansion of B by (1) an intersection of existing 
relations, (2) a projection of an existing relation, (3) a product of an existing relation with B, or (4) the 
equality relation. In each of these cases, we will consider an expansion B' of B where the signature of B' 
has an additional symbol T. We will use cr to denote the signature of B, and so the signature of B' will be 
aU{T}. 

Bv TheoremfTOl it suffices to show that for every ti > 1 and for all finite substructures C of Ind(p(B')") 
there exists a homomorphism h from C to B'. Let C be a finite subset of the universe of Ind(p(B)"), let 
C be the induced substructure of Ind(p(B)") with universe C, and let C be the induced substructure of 
Ind(p(B')") with universe C. By TheoremfTOl there is a homomorphism h from C to B. Since B' is an 
expansion of B with just one additional symbol T, it suffices to show that h{T'-^ ) C . 

(1) : Suppose that T^' ^ R^nS^ for R,S ea. It follows that T^^^'' C p ^.^(b)^ ^^.^^^ ^j^j^j^ 
we obtain T*^' C i?*^ n S"^. For any tuple t e we thus have h(t) e i?^ n S^, and hence hf^) e T^' . 

(2) : Suppose that is the relation 

{{ti,...,tk) I 3tk+i,...,tk+i so that {ti,...,tk+i) ei?^} 

that is, is the projection of R^ onto the first k coordinates; we denote the arity of i? G a by fc + /. (We 
assume that the projection is onto an initial segment {1, . . . , /c} of coordinates for the sake of notation; a 
similar argument holds for an arbitrary set of coordinates.) We have that T*^'^ ' is the projection of R^'^^^ 
onto the first k coordinates. Thus, for any tuple t G T*-' , we have that h{t) is the projection of a tuple in 
i?^,and is hence inT^'. 

(3) : Suppose that T^' = R^ x B for i? G ct. Let < = (ti, . . . , tk+i) be any tuple in T*^'. We have 
that {ti, . .. ,tk) G i?*^, and hence h{ti, . ..,tk) G R^. Since we have h{tk+i) G B, it follows that 
/i(ti,...,tfc+i) gt^'. 

(4) : Suppose that T^' = {(6, 6) | G B}. For any tuple (ti, e TP^^'', we have ti ta- For any 
tuple {ti,t2) G we thus also have ti = t2, and we have h{ti,t2) G T^'. □ 

We now consider homomorphic equivalence. 

Theorem 12 Let 'B be a structure. Suppose that PAC decides CSP(B) and that B' is a structure that is 
homomorphically equivalent to B, that is, B B' andU' B. Then PAC decides CSP(B'). 

We first establish the following lemma. 

Lemma 13 Let f be a homomorphism from B' to B. The map /' defined on p{B') \ {0} by f'(U) — 
{/(m) \ u £ U} is a homomorphism from p(B') to p(B). 

Proof. Let i? be a symbol, and let t be a tuple in R«'^^'\ We have t = {niS, . . . ,7rfc5') where S C R^ and 
5 7^ 0. Define S' = {/(s) | s G 5}. We have S' C R^' . As f{t) = (ttiS", . . . , tt^S"), the conclusion 
follows. □ 
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Proof. (Theorem[T2]) Suppose that (A, B') has the PACC. We want to show that A B'. 

We first show that (A, B) has the PACC. Let a be an element of A. There exists a homomorphism h 
from A to p(B') such that h{a) is a singleton. The mapping /' given by Lemma [T3] is a homomorphism 
from p(B') to p(B) that maps singletons to singletons. Hence, the map a — > f'{h{a)) is a homomorphism 
from A to p(B) mapping a to a singleton. We thus have that (A, B) has the PACC. 

Since PAC decides CSP(B), there is a homomorphism from A to B. By hypothesis, there is a homo- 
morphism from B to B', and so we obtain that A is homomorphic to B. □ 

7 Tractability by PAC 

Slice-semilattice operations. We first study a class of ternary operations. Recall that a semilattice 
operation is a binary operation that is associative, commutative, and idempotent, and that a semilat- 
tice operation © is well-defined on finite sets, that is, for a finite set 5 = {si, . . . , s„} we may define 
®{S) = ®(©(. . . © (©(si, S2), S3), . . .), s„). We say that a ternary operation t : B'^ ^ B is 3. slice- 
semilattice operation if for every element b G B, the binary operation ©f, defined by ©b(x, y) = t{x, y, b) 
is a semilattice operation. These ternary operations have been studied in [5 1; there, the following examples 
were presented. 

Example 14 Let B be a set, and let d : B"^ B be the operation such that d{x, y, z) is equal to x if x = y, 
and z otherwise. This operation is known as the dual discriminator on B, and is an example of a slice- 
semilattice operation. For examples of constraint languages that have a dual discriminator polymorphism, 
see e.g. [14^ . 

Example 15 Let B be a subset of the rational numbers, and let median : B'^ B be the ternary oper- 
ation on B that returns the median of its arguments. (Precisely, given three arguments Xi, X2, and X3 in 
ascending order so that Xi < X2 < x^, the median operation returns X2.) This operation is an example of 
a slice-semilattice operation. 

Theorem 16 Let ^ be a finite structure that has a slice-semilattice polymorphism. Then, the problem 
CSPiB) is tractable by PAC. 

Proof. Let / denote the slice-semilattice polymorphism. By Theorem[TO] it suffices to show that for every 
finite substructure C of Ind(p(B)") there is a homomorphism h from C to B. 

For each element {Si, . . . , Sn) in C we define h{Si, . . . , S„) as follows. Let g be the maximum 
index such that Sg is a singleton; we are guaranteed the existence of such an index by the definition of 
Ind(p(B)"). We define a sequence of values 6g, . . . , 6„ e B inductively. Set bg to be the value such that 
{bg} = Sg. For i with g < i < n, define bi — (Bbt^iSi. We define h{Si, . . . , Sn) — bn- 

We now show that h is in fact a homomorphism from C to B. Let R be any symbol of arity k. Suppose 
that {{S\, ...,Sl),..., (S'^ . . . , S^)) e i?ind(p(B)")^ jjgfljjg ^ sequence of tuples FT, . • • , <^ e i?^ in 
the following way. Let ii be any tuple such that £ (5^ x ■ • • x S'^)ni?^. Foriwithl <i< n,we define 
t~ = (©t(,_i)i5'j^ . . . , ©t,,_l)^^'S'f ). Given that U-i is in R^, we prove that 77 is in R^ . Let d C be a 

set of tuples such that (tti (Ci), . . . , 7rfe(Ci)) = {S}, . . . , S'f ). Let c^, . . . , c™ with m > 2 be a sequence of 
tuples such that {c^,...,d^] ^ C,. W^have U = /(^, • • • /(^, f ((? ,1^ ~i) . . . ,~). Since 

/ is a polymorphism of R^, we obtain ti £ R^. 

Observe now that for each tuple (5f , . . . , S"^), the values bg, . . . ,bn that were computed to determine 
h{Sl, . . . , S*^) = bn have the property that for each i with g < i < n, bi = tij. It follows that h is the 
desired homomorphism. □ 

It is well-known that the problem 2-SAT can be identified with the problem CSP(B) for the structure 
B with universe B ~ {0,1} and relations 

i?fo„)={0,lp\{(0,0)} 
i?foi)={0,lp\{(0,l)} 
i?fii)={0,ir\{(l,l)} 
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It is known, and straightforward to verify, that the dual discriminator operation on {0, 1} is a polymorphism 
of this structure B. We therefore obtain the following. 

Theorem 17 The problem 2-SAT is tractable by PAC. 

Let (7 be the signature {E} where £^ is a symbol having arity 2. We call a structure G over a an 
undirected bipartite graph if is a symmetric relation, the universe G of G is finite, and G can be 
viewed as the disjoint union of two sets Vb and Vi such that E^ C (Vo x Vi) U (V^i x Vb). 

Theorem 18 Let G be an undirected bipartite graph. The problem CSP{G) is tractable by PAC. 

Proof. Let G' be the bipartite graph with universe {0, 1} and where E^' = {(0, 1), (1,0)}. As E'^' 
is pp-definable over the structure B corresponding to 2-SAT above, by 0(wi,W2) = ^(o,i)(''^i, ''^2) A 
R{o,i){v2,vi), we have that PAC decides CSP(G') by TheoremfTT] 

If £'*^ is empty, the claim is trivial, so assume that [s.s') e i?*^. We claim that G and G' are 
homomorphically equivalent, which suffices by Theorem [12] The map taking — ^ s and 1 ^ s' is a 
homomorphism from G' to G. The map taking all elements in Vq to and all elements in Vi to 1 is a 
homomorphism from G to G'. □ 

We call a finite structure D over signature {A] where A is a binary relation symbol an orientation of 
a cycle if D can be enumerated as di, . . . . d„ such that contains either (di, c?i+i) or (di+i, di) for all 
1 < i < n, contains either ((i„, di) or [dn, di), and contains no other pairs. The orientation of a cycle is 
called unbalanced if the number of elements AP of the form {di, di+i) or ((i„, di) is distinct from n/2. It 
has been shown in |l9l that for every unbalanced orientation of a cycle D there is a linear order on D such 
that D is preserved by the median operation with respect to this linear order. 

We therefore have the following result. 

Theorem 19 Let D be an unbalanced orientation of a cycle. Then CSPiT)) is tractable via PAC. 

The Point Algebra in Temporal Reasoning. The structure (Q, <, is known as the point algebra in 
temporal reasoning. The problem CSP(Q, <, ^) can be solved by the path-consistency procedure 1161 . 

Theorem 20 C5P(Q, <, /) is tractable via PAC. 

Proof. Clearly, the structure (Q; <, ^) has only one orbit. It is well-known that it is also w-categorical lfT2l . 
and therefore has in particular a finite number of pp-definable binary relations. To apply Theorem|9] we 
only have to verify that PAC decides CSP(Q; <, 7^). 

Let A be an instance of CSP(Q; <, We claim that if there is a sequence ai, . . . , £ A such that 
(fli, fli+i) e <^ for all 1 < i < fc, (afc, ax) G <'^, and (oj,, a^) e for some p,q ^ {!,..., k}, then 
there is no homomorphism from A to p(B) such that h{ai) is a singleton {bi}. Suppose otherwise that 
there is such a homomorphism h. By the definition of p(B) there must be a sequence bi, . . . ,bk such that 
bi £ h{ai) for all 1 < i < fc and {bi, bi+i) G <^ for all 1 < i < n. Moreover, {bk, 61) £ and hence 
bi — ■ ■ ■ — bk- But then we have {h{ap), h{aq)) = (61, 61) G a contradiction. Hence, the structure A 
does not have the PACC if A has such a sequence ai , . . . , a^. It is known lfT6l that if A does not contain 
such a sequence, then A (Q; <, ^). This shows that PAC decides CSP(Q; <, □ 

Set Constraints. Reasoning about sets is one of the most fundamental reasoning tasks. A tractable set 
constraint language has been introduced in |8|. The constraint relations in this language are containment 
X C Y ('every element of X is contained in F'), disjointness X\\Y {'X and Y do not have common 
elements'), and disequality X 7^ F ('X and Y are distinct'). In the CSP for this constraint language we 
are given a set of constraints and a set of containment, disjointness, and disequality constraints between 
variables, and we want to know whether it is possible to assign sets (we can without loss of generality 
assume that we are looking for subsets of the natural numbers; note that we allow the empty set) to these 
variables such that all the given constraints are satisfied. It was shown in |3| that this problem can be 
modeled as CSP((D; C, ||, 7^)), where D C 2^ is a countably infinite set of subsets of N, and such that 
{D;C,\\,y^) is w-categorical and has just two orbits (the orbit for 0, and the orbit for all other points). 



9 



Theorem 21 CSP{{D; C, 1 1, ^)) is tractable via PAC. 

Proof. Because {D; C, \ is tj-categorical, it suffices as in the proof of Theorem l20l to verify that PAC 
decides CSP((D; C, 1 1, ^)) in order to apply Theorem|9] 

Let A be an instance of CSP((-D; C, ||, ^)). We claim that if there are four sequences (a}, . . . , a^^), 
. . . , {af, . . . , a|^) of elements from A such that 

• «fei = Ofc, = a? = af , 

• (a- , a-+i) e for all 1 < j < 4, 1 < i < 

• ia\,al) e^^, and 

• e 11^. 

then there is no homomorphism h from A to p(B) such that h{a\) is a singleton {&}}. Suppose otherwise 
that there is such a homomorphism h. By the definition of p(B) there must be sequences of elements 
(61,..., 61 J,..., (61,..., 6^) such that 

• bl e h{al) for all 1 < j < 4, 1 < i < kj, 

.1,1 _ !,2 _ v3 _ ui 

• "fei — % — Ol — Ol' 

• iH^K+i) e for all 1 < j < 4, 1 < i < fc^, 

• (6}, 62) e^B^^jjjj 

The third item and the fourth item together imply that b\_^ ~ ^ (™y set that contains two distinct 

sets cannot be empty). The third item and the fifth item together imply that 6^^ = 6^,^ = (any set that is 
contained in two disjoint subsets must be the empty set), a contradiction. 

It follows from Lemma 3.7. in [8| that if A does not contain such sequences, then A {D; C, 1 1, ^). 
This shows that PAC decides CSP(i:'; C, 1 1, ^). □ 

PAC tractability results can also be shown for the basic binary relations in the spatial reasoning formal- 
ism of RCC-5 1 13 1, which is closely related to set constraints, but also for other known tractable spatial 
constraint satisfaction problems in qualitative spatial reasoning, e.g., in ||6l. 
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Abstract 

This paper studies peek arc consistency, a reasoning technique that extends the well- 
known arc consistency technique for constraint satisfaction. In contrast to other more costly 
extensions of arc consistency that have been studied in the literature, peek arc consistency 
requires only linear space and quadratic time and can be parallelized in a straightforward 
way such that it runs in linear time with a linear number of processors. We demonstrate 
that for various constraint languages, peek arc consistency gives a polynomial-time decision 
procedure for the constraint satisfaction problem. We also present an algebraic characteriza- 
tion of those constraint languages that can be solved by peek arc consistency, and study the 
robustness of the algorithm. 

1 Introduction 

Background. A basic knowledge reasoning task that has been studied in many incarnations is to decide 
the satisfiability of given relationships on variables, where, for instance, variables may represent objects 
such as temporal events or spatial regions, and relationships may express precedence, containment, overlap, 
disjointness, and so forth. Instances of this reasoning task can typically be modeled using the constraint 
satisfaction problem (CSP), a computational problem in which the input consists of a set of constraints 
on variables, and the question is whether or not there is an assigimient to the variables satisfying all of 
the constraints. While the CSP is in general NP-hard, researchers have, in numerous settings, aimed to 
identify restricted sets of relationships under which the CSP is polynomial-time decidable; we refer to sets 
of relationships as constraint languages. 

Arc consistency is an algorithmic technique for constraint satisfaction that has been heavily studied 
and for which highly efficient implementations that are linear in both time and space are known. Arc 
consistency provides a one-sided satisfiability check. It may detect an inconsistency, which always implies 
that the input instance is unsatisfiable. While the converse does not hold in general, it has been shown 
to hold for some particular constraint languages, that is, arc consistency provides a decision procedure 
for satisfiability for these languages. Examples include the language of boolean Horn clauses; various 
graph homomorphism problems, for example, homomorphisms to orientations of finite paths [11]; and all 
constraint languages where satisfiability is first-order definable [1]. 

Curiously, arc consistency typically cannot be used as a decision procedure for infinite -domain con- 
straint languages, by which we mean constraint languages under which variables can take on infinitely 
many values. In many cases, a reason for this is that arc consistency performs inference by considering 
unary (arity 1) projections of relations, and all such projections are already equal to the full domain of the 
language. As an example, consider the binary relations < and ^ interpreted over the domain of rational 
numbers Q. For each of these relations, both of the two possible unary projections are equal to Q, and arc 
consistency in fact will not perform any inference. 

Path consistency is a more powerful algorithmic technique that provides a polynomial-time decision 
procedure for further finite and infinite domain constraint languages. However, the greater power comes 
at the price of worse time and space complexity: the best known implementations require cubic space and 
quadratic time. Unfourtunately, this makes the path consistency procedure prohibitive for many applica- 
tions where one has to deal with large instances of the constraint satisfaction problem. 
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Peek arc consistency. In this paper, we study a general algorithmic technique for constraint satisfaction 

that we call peek arc consistency. Here, we describe the idea of the algorithm for finite-domain constraint 
satisfaction, although, as we show in the paper, this algorithm can be effectively applied to infinite-domain 
constraint satisfaction as well. The algorithm performs the following. For each variable-value pair (x, a), 
the variable x is set to the value a, and then the arc consistency procedure is run on the resulting instance 
of the CSP. If there is a variable x such that for all values a the arc consistency procedure detects an 
inconsistency on (x, o), then the algorithm reports an inconsistency. As with arc consistency, this algo- 
rithm provides a one-sided satisfiability check. One might concieve of this algorithm as being a step more 
sophisticated than arc consistency; it invokes arc consistency as it takes a "peek" at each variable. 

Peek arc consistency has many practical and theoretical selling points. Arc consistency can be im- 
plemented in linear space, for any fixed finite-domain constraint language and many infinite-domain con- 
straint languages; the same holds for peek arc consistency. The time complexity of peek arc consistency is 
quadratic in the input size, which is still much better than the path-consistency algorithm, where the best 
known implementations have a running time that is cubic in the input size. Moreover, peek arc consistency 
can be parallelized in a straightforward way: for each variable-value pair, the arc consistency procedure 
can be performed on a different processor. Hence, with a linear number of processors, we achieve a linear 
running time, for a fixed constraint language. We would also like to remark that implementing peek arc 
consistency is straightforward if one has access to an implementation of arc consistency as a subroutine. 

We demonstrate that the class of constraint languages solvable by peek arc consistency is a considerable 
extension of that which can be solved by arc consistency, and in particular contains many infinite-domain 
constraint languages. Examples are the constraint satisfaction problem for the point algebra in temporal 
reasoning [15], and tractable set constraints [8]. But also, several finite-domain constraint languages where 
previously the "best" known algorithm was the path-consistency procedure can be solved by our peek arc 
consistency procedure. For example, this is the case for homomorphism problems to unbalanced orienta- 
tions of cycles [9]. Other examples that can be solved by peek arc consistency but not by arc consistency 
are 2-SAT, and many other CSPs where the relations are closed under a dual descriminator or a median 
operation. 

Our study of peek arc consistency employs universal algebraic techniques which have recently come 
into focus in the complexity of constraint satisfaction. In addition to obtaining results showing that lan- 
guages are tractable by this algorithm, we develop an algebraic characterization of the constraint languages 
solvable by the algorithm. The characterization is exact-necessary and sufficient-for all finite and infi- 
nite domain constraint languages. We also exhibit closure properties on the class of constraint languages 
tractable by the algorithm. 

A notable feature of this work is the end to which universal algebraic techniques are applied. Thus far, in 
constraint satisfaction, such techniques have primarily been used to demonstrate complexity class inclusion 
results, such as polynomial-time decidability results, and completeness results, such as NP-completeness 
results. Here, we utilize such techniques to investigate the power of a particular efficient and practical 
algorithm. That is, we differentiate among constraint languages depending on whether or not they are 
solvable via a specific algorithmic method, as opposed to whether or not they are contained in a complexity 
class. To our knowledge, this attitude has only been adopted in a limited number of previous papers that 
studied arc consistency and extensions thereof [7, 5]. 

2 Preliminaries 

Our definitions and notation are fairly standard. 

Structures. A tuple over a set B is an element of B'' for a value fc > 1 called the arity of the tuple; when 
t is a tuple, we use the notation t = {ti, . . . ,tk) to denote its entries. A relation over a set i? is a subset 
of B'' for a value k > 1 called the arity of the relation. A signature cr is a finite set of symbols, each of 
which has an associated arity. We use tt^ to denote the operator that projects onto the ith coordinate: tt^ (t) 
denotes the ilh entry of atuplei = (fi, . . . ,tk), and for a relation i? we define 77^(7?) = {7ri(i) | t e i?}. 

A structure B over signature a consists of a universe B, which is a set, and a relation C B^ 
for each symbol R of arity k. (Note that in this paper, we are concerned only with relational structures, 



2 



which we refer to simply as structures.) Throughout, we will use the bold capital letters A, B, ... to denote 
structures, and the corresponding non-bold capital letters A,B,...to denote their universes. We say that a 
structure B is finite if its universe B has finite size. 

For two structures A and B over the same signature a, the product structure A x B is defined to be the 
structure with universe A x B and such that R^^^ = {((ai, 61), . . . , (a/j, bk)) \a G R^,b G i?^} for all 
R G a. We use A" to denote the n-fold product A x • • • x A. 

We say that a structure B over signature a' is an expansion of another structure A over signature a if 
(1) ct' 3 cr, (2) the universe of B is equal to the universe of A, and (3) for every symbol R <= a, it holds 
that R^ = R^. We will use the following non-standard notation. For any structure A (over signature a) 
and any subset SCA,-we define [A, S] to be the structure with the signature aU {U} where U isa new 
symbol of arity 1, defined by t/I^ '^'l = S and -^l = R^ for all R e a. 

For two structures A and B over the same signature a, we say that A is an induced substructure of B 
if AC B and for every e cr of arity k, it holds that R^ = A'' D R^. Observe that for a structure B and 
a subset B' C B, there is exactly one induced substructure of B with universe B'. 

Homomorphisms and the constraint satisfaction problem. For structures A and B over the same 

signature cr, a homomorphism from A to B is a mapping h : A ^ B such that for every symbol ii of cr 
and every tuple (oi, . . . , ak) S R^, it holds that {h{a{), . . . , h{ak)) G R^. We use A ^ B to indicate 
that there is a homomorphism from A to B; when this holds, we also say that A is homomorphic to B. 
The homomorphism relation is transitive, that is, if A ^ B and B ^ C, then A ^ C. 

For any structure B (over cr), the constraint satisfaction problem for B, denoted by CSP(B), is the 
problem of deciding, given as input a finite structure A over cr, whether or not there exists a homomorphism 
from A to B. In discussing a problem of the form CSP(B), we will refer to B as the constraint language. 

There are several equivalent definitions of the constraint satisfaction problem for a constraint language, 
most notably the definition used in artificial intelligence. In logic, the constraint satisfaction problem can 
be formulated as the satisfiability problem for primitive positive formulas in a fixed structure B. Homo- 
morphism problems as defined above have been studied independently from artificial intelligence in graph 
theory, and the connection to constraint satisfaction problems has been observed in [10]. 

pp-definability. Let cr be a signature; a primitive positive formula over cr is a formula built from atomic 
formulas R{w\, . . . , w„) with R G a, conjunction, and existential quantification. A relation R C B'^ 
is primitive positive definable (pp-definable) in a structure B (over cr) if there exists a primitive positive 
formula ^{vi ,Vk) with free variables vi,. . . ,Vk such that 

(61, ...,bk) &R<^B,bi,...,bk\=(f>. 

Automorphisms. An isomorphism between two relational structures A and B over the same signature a 

is a bijective mapping from Ato B such that t G R,^ if and only if f{t) G R^ for all relation symbols R 
in a. An automorphism of A is an isomorphism between A and A. An orbit of A is an equivalence class 
of the equivalence relation = that is defined on ^ by a; = y iff a{x) = y for some automorphism a of A. 

Polymorphisms. When/ : B" — > i? is an operation on i? and = {tu, . . . ,tik), . . . ,tn = {tni, ■ ■ ■ ,t„k) € 
B'' are tuples of the same arity k over B, we use /(ti, . . . , i„) to denote the arity k tuple obtained by ap- 
plying / coordinatewise, that is, f{t^, . . . ,Q = (/(in, . . . , . . . , /(tife, • • ■ , infe))- An operation 
/ : ^ i? is a polymorphism of a structure B over cr if for every symbol R G a and any tuples 
h,. . . G R^, it holds that f(tl, . ■ .,t^) € R^. That is, each relation R^ is closed under the action of 
/. Equivalently, an operation / : B" —> B is a polymorphism of B if it is a homomorphism from B" to 
B. Note that every automorphism is a unary polymorphism. 

Categoricity. Several of our examples for constraint languages over infinite domains will have the fol- 
lowing property that is of central importance in model theory. A countable structure is oj-categorical if 
all countable models of its first-order theory' are isomorphic. By the Theorem of Ryll-Nardzewski (see 

^The first-order theory of a structure is the set of first-order sentences that is true in the structure. 
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e.g. [12]) this is equivalent to the property that for each n there is a finite number of inequivalent first-order 

formulas over F with n free variables. A well-known example of an tj-categorical structure is (Q. <); for 
many more examples of w-categorical structures and their application to formulate well-known constraint 
satisfaction problems, see [2]. 

3 Arc Consistency 

In this section, we introduce the notion of arc consistency that we will use, and review some related notions 
and results. The definitions we give apply to structures with relations of any arity, and not just binary rela- 
tions. The notion of arc consistency studied here is sometimes called hyperarc consistency. Our discussion 
is based on the paper [7]. 

For a set B, let p{B) denote the power set of B. For a structure B (over a), we define p(B) to 
be the structure with universe p{B) \ {0} and where, for every symbol i? G a of arity k, R*^^^^ = 

{{TT,S,...,7TkS)\SCR^,Sj^<D}. 

Definition 1 An instance A of CSP(B) has the arc consistency condition (ACC) if there exists a homo- 
morphismfrom A to p(B). 

Definition 2 We say that arc consistency (AC) decides C>S'P(B) if for all finite structures A, the following 
holds: (A, B) has the ACC implies that A — > B. 

Note that the converse of the condition given in this definition always holds. By a singleton, we mean 
a set containing exactly one element. 

Proposition 3 For any structures A and B, if h is a homomorphism from A to B, then the mapping that 
takes a to the singleton {h{a)} is a homomorphism from A to p(B). 

Hence, when AC decides CSP(B), an instance A of CSP(B) is a "yes" instance if and only if A has 
the ACC with respect to B. That is, deciding whether an instance A is a "yes" instance can be done 
just by checking the ACC. It was observed in [10] that, for any finite structure B, there is an algebraic 
characterization of AC: AC decides CSP(B) if and only if there is a homomorphism from p(B) to B. 

It is well-known that for a finite structure B, whether or not instances A of CSP(B) have the ACC 
can be checked in polynomial-time. The algorithm for this is called the arc consistency procedure, and 
it can be implemented in linear time and linear space in the size of A; note that we consider B to be 
fixed. The same holds for many infinite-domain constraint languages, for example for all cj-categorical 
constraint languages. Since this is less well-known, and requires a slightly less standard formulation of 
the arc consistency procedure, we present a formal description of the algorithm that we use; this algorithm 
can be applied for any (finite- and infinite-domain) constraint language that has finitely many pp-definable 
unary relations in B. 

We assume that B contains a relation for each unary primitive positive definable relation in B. This is 
not a strong assumption, since we might always study the expansion B' of B by all such unary relations. 
Then, if we are given an instance A of CSP(B), we might run the algorithm for CSP(B') on the expansion 
A' of A that has the same signature as B' and where the new unary relations are interpreted by empty 
relations. It is clear that a mapping from ^4 to i? is a homomorphism from A to B if and only if it is a 
homomorphism from A' to B'. 

To conveniently formulate the algorithm, we write R^(x) for the relation symbol of the relation that 
is defined by a pp-formula (t){x) in B. We write QaCIai, • • • , a/}) for the conjunction over all formulas 
of the form S{ai) where S' is a unary relation symbol such that Oi G S^. The pseudo-code of the arc 
consistency procedure can be found in Figure 1 . 

The space requirements of the given arc consistency procedure are clearly Unear. It is also well-known 
and easy to see that the procedure can be implemented such that its running time is linear in the size of the 
input. 

Proposition 4 Let B be a structure with finitely many primitive-positive definable unary relations. Then a 
given instance A ofCSP^B) has the ACC if and only if the arc consistency procedure presented in Figure 1 
does not reject. 



4 



Arc ConsistencyB(A) 






Input: a finite relational structure A. 






Do 






For every relation symbol R, every tuple (fti , 


. . . , ft;) G -R , and every i G -[1, . . 




Let (j) be the formula 3ai, . . . , Oj-i, ai+i, 


. . . , ai.R{ai, ...,ai) A QaHui, . 


■,ai}) 


If (p defines the empty unary relation over 


B then reject 




Else, add ai to i?^ 






Loop until no relation in A is changed 







Figure 1: The arc consistency procedure for CSP(B), where B contains all primitive positive definable 
unary relations in B. 

In particular, we can apply the algorithm shown in Figure 1 to all constraint satisfaction problems 
with an w-categorical constraint language. However, it was shown that in this case the algorithm cannot 
be used as a decision procedure for CSP(B) (i.e., that rejects an instance A if and only if it does not 
homomorphicaUy map to B), unless B is homomorphicaUy equivalent to a finite structure [4]. 

4 Peek Arc Consistency 

We present basic definitions and results concerning peek arc consistency. The following two definitions are 
analogous to Definitions 1 and 2 of the previous section. 

Deftnition 5 An instance (A, B) of the CSP has the peek arc consistency condition (PACC) if for every 
element a & A, there exists a homomorphism hfrom A to p(B) such that h{a) is a singleton. 

Definition 6 We say that peek arc consistency (PAC) decides C>S'P(B) if for all finite structures A, the 
following holds: (A, B) has the PACC implies that A ^ B. 

The converse of the condition given in this definition always holds. Suppose that A ^ B; then, the 
mapping taking each a £ Atothc singleton {h{a)} is a homomorphism from A to p(B) (Proposition 3), 
and hence (A, B) has the PACC. 

We now present an algorithm that decides for a given instance A of CSP(B), whether (A, B) has the 
PACC. We assume that B has a finite number of orbits and pp-definable binary relations. This holds in 
particular for all w-categorical structures. The following lemma then allows us to use the arc consistency 
procedure presented in Figure 1 for every expansion of B by singletons. 

Lemma 7 Let 'B be a structure with finitely many pp-definable binary relations. Then every expansion of 
Bby a constant has finitely many pp-definable unary relations. 

Proof. Suppose for contradiction that for a constant 6, there are infinitely many pairwise distinct unary 
relations with a pp-definition in the expansion of B with {b). For each such definition, if we replace the 
occurrences of the relation symbol for the singleton {6} by a new variable, we obtain formulas that are 
pp-definitions in B of pairwise distinct binary relations. □ 

Proposition 8 Let 'B be a structure with finitely many orbits and finitely many pp-definable binary rela- 
tions. Then a given instance A ofCSPCB) has the PACC if and only if the algorithm presented in Figure 2 
does not reject A. 

Proof. Suppose that A is an instance of CSP(B) that has the PACC. We have to show that for any element 
a from A there exists an orbit O of B such that for any choice of 6 e O the arc consistency procedure that is 
called in the inner loop of the algorithm in Figure 2 does not reject the instance [A, {a}] of CSP([B, {6}]). 
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Peeks (A) 

Input: a finite relational structure A. 

For every element a of A 
For all 6 e {61,..., 6;} 

Run the arc-consistency procedure on the instance [A, {a}] of CSP([B, {b}]) 
If the arc consistency procedure rejects for pp-definable sets, reject 



Figure 2: The peek arc consistency procedure for CSP(B) for structures B that have finitely many orbits 
and pp-definable binary relations. Let &i, .... 6; be arbitrary representatives from the orbits in B, i.e., we 
assume that there are / orbits and 61, . . . , 6; are in pairwise distinct orbits. 

Because A has the PACC, there exists a homomorphism from A to p(B) such that h{a) is a singleton 
{c}. Let O be the orbit of c, and let b be the element from O that is used by the algorithm. We know 
that there exists an automorphism a that maps c to b. Clearly, the mapping h' defined by x a{h{x)) 
is a homomorphism from A to p(B) such that h'{a) = {b} is a singleton. By Lemma 7 the structure 
[B, {b}] has finitely many pp-definable unary relations. Proposition 4 then shows that the arc consistency 
procedure does not reject the instance [A, {a}] of CSP([B, {6}]). All the impUcations in this argument can 
be reversed, which shows the statement of the proposition. □ 

Theorem 9 Let ^ be a structure with finitely many orbits and finitely many pp-definable binary relations, 
and suppose that PAC solves C5'P(B). Then there exists a quadratic-time and linear-space algorithm that 
decides C>S'P(B). Moreover, CSP{S) can be decided in linear time with a linear number of processors. 

Proof. Let A be an instance of CSP(B). If the algorithm in Figure 2 rejects A, then it does not have the 
PACC and hence A does not homomorphically map to B. If the algorithm in Figure 2 does not rejects A, 
then A has the PACC. By assumption, PAC solves CSP(B), and therefore there exists a homomorphism 
from A to B. 

Because the arc consistency procedure uses linear space, the algorithm in Figure 2 can be implemented 
in hnear space as well. The arc consistency procedure is called a linear number of times (recall that B is 
fixed and not part of the input). Because the arc consistency procedure can be implemented such that it 
uses linear time, the overall running time on a sequential machine is quadratic in the worst case. However, 
note that each application of the arc consistency procedure can be performed on a different processor. □ 

5 Algebraic Characterization 

In this section we present a general algebraic characterization of those constraint languages where PAC 
decides CSP(B) for an arbitrary finite or infinite structure B. 

We use the notation Ind(p(B)") to denote the induced substructure of p(B)" whose universe contains 
an n-tuple of p(B)" if and only if at least one coordinate of the tuple is a singleton. 

Theorem 10 Let 'B be a structure. PAC decides CSP{B) if and only if for all n there is a homomorphism 
from all finite substructures c»/Ind(p(B)") to B. 

Proof. (^): Suppose that (A, B) has the PACC. Then, by definition of the PACC, for all a e A, there is a 
homomorphism ha from [A, a] to [p(B), {{b} \ b G B}]. Let n = \A\. Now consider the homomorphism 
h from A to p(B)" defined by h{x) = Ila£Aha{x). Notice that for every o e ^, the element ha{a) 
of the tuple h{a) is a singleton, and hence h is in fact a homomorphism from A to lnd(p(B)"). Let C 
be the structure that is induced by the image of /i in Ind(p(B)"). Since C is finite, it is by assumption 
homomorphic to B, and by composing homomorphisms we obtain that there is a homomorphism from A 
toB. 
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(=>): Let n > 1, and let C be a finite substructure of Ind(p(B)"). We have to show that C, viewed as 
an instance of CSP(B), has the PACC, which suffices by assumption. Let a be any element of the universe 
of C. By definition of C, we have that a is an n-tuple such that some coordinate, say the ith coordinate, is 
a singleton. The projection function tt, is a homomorphismfrom [C, {a}] to [p(B), {{6} | b G B}]. □ 

6 Robustness 

In this section, we demonstrate that the class of structures B such that PAC decides CSP(B) is robust in 
that it satisfies certain closure properties. 

We first investigate expansion by pp-definable relations. Say that a structure B' (over a') is a pp- 
expansion of B (over a) if B' is an expansion of B and for every symbol i? e cr' \ ct, it holds that is 
pp-definable over B. 

Theorem 11 Suppose that PAC decides CSP{S). Then for any pp-expansion B' o/B, it holds that PAC 
decides CSP{B'). 

Proof. It suffices to show that the theorem holds for an expansion of B by (1) an intersection of existing 
relations, (2) a projection of an existing relation, (3) a product of an existing relation with B, or (4) the 
equality relation. In each of these cases, we will consider an expansion B' of B where the signature of B' 
has an additional symbol T. We will use a to denote the signature of B, and so the signature of B' wiU be 

au{r}. 

By Theorem 10, it suffices to show that for every n > 1 and for all finite substructures C of Ind(p(B')") 
there exists a homomorphism h from C to B'. Let C be a finite subset of the universe of Ind(p(B)"), let 
C be the induced substructure of Ind(p(B)") with universe C, and let C be the induced substructure of 
Ind(p(B')") with universe C. By Theorem 10, there is a homomorphism h from C to B. Since B' is an 
expansion of B with just one additional symbol T, it suffices to show that h{T^ ) C . 

(1) : Suppose that T^' =R^r\S^ for R,S €a. It follows that T^^^') C n S>^^^\ from which 
we obtain T^' CR^nS*^. For any tuple t e T^', we thus have h{t) G R^Ci S^, and hence h{t) € T^'. 

(2) : Suppose that is the relation 

{{ti,...,tk) I 3tk+i,...,tk+i sothat(ti,...,ife+;) e R^} 

that is, is the projection of R^ onto the first k coordinates; we denote the arity of R G ahy k + 1. (We 
assume that the projection is onto an initial segment {1, .... A;} of coordinates for the sake of notation; a 
similar argument holds for an arbitrary set of coordinates.) We have that T*^'^ ' is the projection of R^^^^ 
onto the first k coordinates. Thus, for any tuple t G T'-' , we have that h{t) is the projection of a tuple in 
R^, and is hence in . 

(3) : Suppose that T^' = R^ x B for R e a. Lett= {h, . . . , tk+i) be any tuple in T^'. We have 
that {ti, . . . ,tk) G i?*^, and hence h{ti, . . . ,tk) G R^. Since we have h{tk+i) G B, it follows that 
/i(<i,...,tfc+i) GT^'. 

(4) : Suppose that T^' = {{b, b) \ b e B}. For any tuple (ti, G T«'^'^'\ we have h = t2. For any 
tuple (ti, i2) G T*^' we thus also haveti = t2, and we have h{ti,t2) G T^'. □ 

We now consider homomorphic equivalence. 

Theorem 12 Let B be a structure. Suppose that PAC decides C5f (B) and that B' is a structure that is 
homomorphically equivalent to B, that is, B — > B' and B' B. Then PAC decides CSPCB'). 

We first estabUsh the following lemma. 

Lemma 13 Let f be a homomorphismfrom B' to B. The map f defined on p{B') \ {0} by f'{U) = 
{f{u) \ u gU} is a homomorphismfrom p(B') to p(B). 

Proof. Let i? be a symbol, and let t be a tuple in Rf^'^'\ We have t = {ttiS, . . .,TTkS) where S C R^ and 
5/0. Define S' = {/(s) | s G 5}. We have S' C R^' . As f'{t) = {mS', . . .,TTkS'), the conclusion 
follows. □ 
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Proof. (Theorem 12) Suppose that (A, B') has the PACC. We want to show that A B'. 

We first show that (A, B) has the PACC. Let a be an element of A. There exists a homomorphism h 
from A to p(B') such that h{a) is a singleton. The mapping /' given by Lemma 13 is a homomorphism 
from p(B') to p(B) that maps singletons to singletons. Hence, the map a — > f'{h{a)) is a homomorphism 
from A to p(B) mapping a to a singleton. We thus have that (A. B) has the PACC. 

Since PAC decides CSP(B), there is a homomorphism from A to B. By hypothesis, there is a homo- 
morphism from B to B', and so we obtain that A is homomorphic to B. □ 

7 Tractability by PAC 

Slice-semilattice operations. We first study a class of ternary operations. Recall that a semilattice 
operation is a binary operation that is associative, commutative, and idempotent, and that a semilat- 
tice operation © is well-defined on finite sets, that is, for a finite set S = {si, . . . , s„} we may define 
(B{S) = ©(©(. . . © (©(si, S2), S3), . . .), s„). We say that a ternary operation t : ^ B is a slice- 
semilattice operation if for every element b G B,the binary operation ©(, defined by (Bb{x, y) = t{x, y, b) 
is a semilattice operation. These ternary operations have been studied in [5]; there, the following examples 
were presented. 

Example 14 Let Bbea set, and letd : B^ ^ B be the operation such that d{x, y, z) is equal to x ifx = y, 
and z otherwise. This operation is known as the dual discriminator on B, and is an example of a slice- 
semilattice operation. For examples of constraint languages that have a dual discriminator polymorphism, 
see e.g. [14]. 

Example 15 Let B be a subset of the rational numbers, and let median : B^ B be the ternary oper- 
ation on B that returns the median of its arguments. (Precisely, given three arguments xi, xi, and 2:3 in 
ascending order so thatx\ <X2< x^, the median operation returns X2.) This operation is an example of 
a slice-semilattice operation. 

Theorem 16 Let B be a finite structure that has a slice-semilattice polymorphism. Then, the problem 
CSP{B) is tractable by PAC. 

Proof. Let / denote the slice-semilattice polymorphism. By Theorem 10, it suffices to show that for every 
finite substructure C of Ind(p(B)") there is a homomorphism h from C to B. 

For each element {Si, . . . , Sn) in C we define h{Si, . . . , Sn) as follows. Let g be the maximum 
index such that Sg is a singleton; we are guaranteed the existence of such an index by the definition of 
Ind(p(B)"). We define a sequence of values 6g, . . . , 6„ e B inductively. Set bg to be the value such that 
{bg} = Sg. For i with g < i < n, define 6, = (Bbi^iSi. We define h{Si, . . . , Sn) = bn. 

We now show that h is in fact a homomorphism from C to B. Let B be any symbol of arity k. Suppose 
that {{Si, ...,S^),..., {S^, 5^)) e Eind(p(B)") jjggjjg ^ sequence of tuples IT, • • • e i?^ in 
the following way. Let ti be any tuple such that h e (S'|x--xS'f)n . For i with 1 < i < n, we define 
Ti = {®t^i_i^iSl, ©*(,_!);, S'f). Given that is in R^, we prove that is in R^. Let Cj C i?^ be a 
set of tuples such that (tti (Q), . . . , Trk{Ci)) — {S^, . . . ,S^). Let c^, . . . , c™ with m > 2 be a sequence of 
tuples such that {c? , . . . , c™} = C^. We have U = /(c^, • • • /((?, t?, U~[),U~[) tP[). Since 
/ is a polymorphism of R^, we obtain ti e R^. 

Observe now that for each tuple (5f , . . . , S-j^), the values bg, . . . ,bn that were computed to determine 
h{S{, . . ., S^) = bn have the property that for each i with g < i < n, bi = tij. It follows that h is the 
desired homomorphism. □ 

It is well-known that the problem 2-SAT can be identified with the problem CSP(B) for the structure 
B with universe B = {0, 1} and relations 

Rf,o) = {o^v\mo)} 

i?foi)={0,lp\{(0,l)} 

^fi,i) = {o,ir\{(i,i)} 
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It is known, and straightforward to verify, that the dual discriminator operation on {0, 1} is a polymorphism 
of this structure B. We therefore obtain the following. 

Theorem 17 The problem 2-SAT is tractable by PAC. 

Let fT be the signature {E} where S is a symbol having arity 2. We call a structure G over a an 

undirected bipartite graph if E*^ is a symmetric relation, the universe G of G is finite, and G can be 
viewed as the disjoint union of two sets Vb and Vi such that E^ C (Vo x Vi) U (Vi x Vb). 

Theorem 18 Let G be an undirected bipartite graph. The problem CSP(G) is tractable by PAC. 

Proof. Let G' be the bipartite graph with universe {0, 1} and where E^' = {(0, 1), (1,0)}. As E'^' 
is pp-definable over the structure B corresponding to 2-SAT above, by 0(t'i,t'2) = R(o,i){vi,V2) A 
-R(o,i)(^'2, vi), we have that PAC decides CSP(G') by Theorem 11. 

If E'^ is empty, the claim is trivial, so assume that (.s, s') e E'^. We claim that G and G' are 
homomorphically equivalent, which suffices by Theorem 12. The map taking ^ s and 1 ^ s' is a 
homomorphism from G' to G. The map taking all elements in Vb to and all elements in Vi to 1 is a 
homomorphism from G to G'. □ 

We call a finite structure D over signature {A} where ^ is a binary relation symbol an orientation of 
a cycle if D can be enumerated as di, . . . , c?„ such that A^ contains either {di, di+i) or {di+i,di) for all 
1 < i < n, contains either {dn,di) or {dn, di), and contains no other pairs. The orientation of a cycle is 
called unbalanced if the number of elements A^ of the form [di, dj+i) or {dn, d\) is distinct from n/2. It 
has been shown in [9J that for every unbalanced orientation of a cycle D there is a Unear order on D such 
that D is preserved by the median operation with respect to this Unear order. 

We therefore have the following result. 

Theorem 19 Let D be an unbalanced orientation of a cycle. Then CSP(D) is tractable via PAC. 

The Pomt Algebra in Temporal Reasoning. The structure (Q, <, 7^) is known as the point algebra in 
temporal reasoning. The problem CSP(Q, <, ^) can be solved by the path-consistency procedure [16]. 

Theorem 20 CSP{Q, <, ^) is tractable via PAC. 

Proof. Clearly, the structure (Q; < , 7^) has only one orbit. It is well-known that it is also w-categorical [12], 
and therefore has in particular a finite number of pp-definable binary relations. To apply Theorem 9, we 
only have to verify that PAC decides CSP(Q; <, 7^). 

Let A be an instance of CSP(Q; <, 7^). We claim that if there is a sequence ai, . . . ,ak € A such that 
(fli, fli+i) G <^ for all 1 < i < fc, (ofe, ai) E <''^, and (op, a^) G for some p,q E {1, . . . , k}, then 
there is no homomorphism from A to p(B) such that h{ai) is a singleton {bi}. Suppose otherwise that 
there is such a homomorphism h. By the definition of p(B) there must be a sequence h\,...,bk such that 
hi G h{ai) for all 1 < i < fc and 5i+i) G <^ for all 1 < z < n. Moreover, (6fe, hi) E <^, and hence 
h\ = ■ ■ ■ = hk- But then we have {h{ap), h{aq)) = {hi, hi) G 7^^, a contradiction. Hence, the structure A 
does not have the PACC if A has such a sequence ai, . . . , a^. It is known [16] that if A does not contain 
such a sequence, then A ^ (Q; <, 7^). This shows that PAC decides CSP(Q; <, 7^). □ 

Set Constraints. Reasoning about sets is one of the most fundamental reasoning tasks. A tractable set 
constraint language has been introduced in [8]. The constraint relations in this language are containment 
X C Y ('every element of X is contained in Y'), disjointness X\\Y ('X and Y do not have common 
elements'), and disequality X ^ Y {'X and Y are distinct'). In the CSP for this constraint language we 
are given a set of constraints and a set of containment, disjointness, and disequality constraints between 
variables, and we want to know whether it is possible to assign sets (we can without loss of generality 
assume that we are looking for subsets of the natural numbers; note that we allow the empty set) to these 
variables such that all the given constraints are satisfied. It was shown in [3] that this problem can be 
modeled as CSP((D; C, ||, ^)), where D C 2^^ is a countably infinite set of subsets of N, and such that 
(-D; C, II, ^) is w-categorical and has just two orbits (the orbit for 0, and the orbit for all other points). 
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Theorem 21 CSP{{D; C,\\,jt))is tractable via PAC. 

Proof. Because (I?; C , 1 1 , is tj-categorical, it suffices as in the proof of Theorem 20 to verify that PAC 
decides CSP((Z); C, \ \,^)) in order to apply Theorem 9. 

Let A be an instance of CSP((Z); C, ||, ^)). We claim that if there are four sequences {a\, . . . , a^^), 
. .., {af , ■ ■ ■ , ) of elements from A such that 

• «fci = Ofc. =al= a\, 

• (a^, aj+i) e for all 1 < j < 4, 1 < i < kj, 

• {a\,a\) and 

then there is no homomorphism h from A to p(B) such that h{a\) is a singleton {h\}. Suppose otherwise 
that there is such a homomorphism h. By the definition of p(B) there must be sequences of elements 
(6i,...,6iJ,...,(&t,...,64jsuchthat 

• 6^ e h{a'i) for all 1 < j < 4, 1 < i < fcj, 

• = bl =bl = hi 

• iH^ H+i) e for all 1 < j < 4, 1 < i < kj, 

The third item and the fourth item together imply that b].^ = ^ ^ '■^y ^^^^ contains two distinct 
sets cannot be empty). The third item and the fifth item together imply that = = (any set that is 
contained in two disjoint subsets must be the empty set), a contradiction. 

It follows from Lemma 3.7. in [8] that if A does not contain such sequences, then A — > {D; C, 1 1, ^). 
This shows that PAC decides CSP{D; C, 1 1, ^). □ 

PAC tractability results can also be shown for the basic binary relations in the spatial reasoning formal- 
ism of RCC-5 [13], which is closely related to set constraints, but also for other known tractable spatial 
constraint satisfaction problems in qualitative spatial reasoning, e.g., in [6]. 
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